HowNet Based Chinese Question Classification

نویسندگان

  • Dongfeng Cai
  • Jingguang Sun
  • Guiping Zhang
  • Dexin Lv
  • Yanju Dong
  • Yan Song
  • Chao Yu
چکیده

Question classification is the first step that Question Answering System must dispose, the precision of question classification greatly affect the subsequent processes. In this paper, we present a new question classification method which uses HowNet as the semantic resource to extract features, and we use Maximum Entropy Model to implement the method. The results validate the effectiveness of this method: the classification precision of coarse classes and fine classes reaches 92.18% and 83.86% respectively.

منابع مشابه

Automatic Recognition of Focus and Interrogative Word in Chinese Question for Classification

Question classification is one of the most important components in a question answering (QA) system. When there are fewer features in a question can be used for classification, the interrogative word and focus in question are critical features. Most previous studies in question classification used heuristic rules to identify the focus and interrogative word in question. In this paper, a statist...

متن کامل

Annotating Information Structures In Chinese Texts Using HowNet

This paper reported our work on annotating Chinese texts with information structures derived from HowNet. An information structure consists of two components: HowNet definitions and dependency relations. It is the unit of representation of the meaning of texts. This work is part of a multi-sentential approach to Chinese text understanding. An overview of HowNet and information structure are des...

متن کامل

HowNet and Its Computation of Meaning

The presentation will mainly cover (1) What is HowNet? HowNet is an on-line common-sense knowledgebase unveiling inter-conceptual relationships and interattribute relationships of concepts as connoting in lexicons of the Chinese and their English equivalents. (2) How it functions in the computation of meaning and as a NLP platform? The presentation will show 9 HowNet-based application tools. Al...

متن کامل

Chinese HowNet-Based Multi-factor Word Similarity Algorithm Integrated of Result Modification

In this paper, we firstly describe a novel approach to calculate the Chinese sememe similarity based on the HowNet hierarchical sememe tree. When we calculate the sememe similarity, we not only take Semantic Distance, Node Depth and Semantic Coincidence Degree into consideration, but also propose two impact factors named Node Environment Dense (NED) and Node Layer Ratio (NLR) to optimize the ca...

متن کامل

A Maximum Entropy Approach To HowNet-Based Chinese Word Sense Disambiguation

This paper presents a maximum entropy method for the disambiguation of word senses as defined in HowNet. With the release of this bilingual (Chinese and English) knowledge base in 1999, a corpus of 30,000 words was sense tagged and released in January 2002. Concepts meanings in HowNet are constructed by a closed set of sememes, the smallest meaning units, which can be treated as semantic tags. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006